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SYSTEM ARCHITECTURE SYNTHESIS AND EXPLORATION FOR 
MULTIPLE FUNCTIONAL SPECIFICATIONS 

BACKGROUND OF THE INVENTION 

Technical Field of the Invention 

The present invention generally pertains to a method or arrangement for 
synthesizing and exploring alternative system architectures executing the functional 
specification of a system in the form of a task graph. More particularly, the invention 
pertains to a method of the above type wherein each synthesized architecture must be 
able to execute all of the tasks in each of a plurality of task specifications. Even more 
particularly, the invention pertains to a method of the above type for synthesizing a large 
pool of architectures, and then identifying the better architectxu-es therein with respect to 
prespecified criteria, in a comparatively short period of time. 

Description of Related Art 

As is well known in the art, a computer system typically has a large number and 
diversity of hardware and software resources that may be employed to execute or 
complete a particular task specification. An example of a task specification is a set of 
tasks that must be carried out in order to operate a cellular phone for wireless 
communication. Generally, many different combinations of hardware and software 
resources are available to execute a particular task specification, including combinations 
of only hardware resources, of only software resources, and mixtures of both types of 
resources. However, while the number of workable combinations may be quite large, 
some resource combinations are more usefiil than others; also some resource 
combinations may have undesirable characteristics or unmtended consequences. The 
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comparative merits of different resource combinations are generally determined by 
timing or other constraints imposed upon implementation of a set of tasks. 

As is further known in the art, architecture space exploration is the process of 
mapping a task specification to a set of resources and time schedules, wherein the set of 
tasks in the specification maybe represented as a graph of interrelated processes and 
communications. The set of resources comprises objects such as software, memory 
elements, special application specific integrated circuits (ASICs) and processors. Each 
solution provided by the mapping process has a corresponding architecture that 
incorporates a particular combination of resource components and that assigns respective 
tasks thereto. The architecture also schedules the timing of task completion. In the past, 
efforts were made to automate the process of selecting a suitable distribution of resources 
to define an architecture within the universe of possible architecture solutions known as 
the design space. Architecture synthesis algorithms were developed for automating the 
process of selecting the distribution of resources to define the architecture. In some cases 
the universe may include millions of possible architectures. Some of these efforts are 
described, for example, in "Research Strategies for Architecture Synthesis and 
Partitioning of Real-Time Systems," Jakob Axelsson, IDA Technical Report, 1996. 

In certain important applications, it is necessary to provide an architecture that is 
capable of executing multiple specifications, wherein each specification is modeled as 
a task graph. The modeling of each specification as a task graph allows designers to 
implement a single architecture that must satisfy multiple task graphs. For example, in 
mobile commimications an architecture may be required to execute both GSM (Global 
System Mobile communications) and CDMA (Code Division Multiple Access) protocols. 
Designing an architecture capable of executing both these protocols could be of 
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considerable value. However, in order to provide such a capability it is necessary to 
(1) effectively model the multiple specifications and (2) determine how the architecture 
synthesis algorithm is to explore and generate architectures for multiple specifications. 

hi another situation pertaining to multiple task graphs, a designer having multiple 
5 choices for implementing a task may prefer to have an architecture synthesis tool select 
the best choice or choices for implementing the task. For example, a designer could 
prefer to let the architecture synthesis tool decide whether to use the MPEG-3 or the 
MPEG-4 encoding scheme for video telephony. In this situation, the final selected 
architecture only needs to support the execution of exactly one of these compression 

1 0 algorithm options. Accordingly, it would be very desirable to provide an architecture 
synthesis tool for exploring better possible architectures when there is a choice of task 
implementations, and then selecting the architecture that provides the best 
implementation. Again, the two main issues to address are the way to model the 
specification, and how the architecture synthesis algorithm uses the model for searching 

15 and generating the architecture. 

SUMMARY OF THE INVENTION 

The present invention is directed to a method for synthesizing and exploring 
alternative system architectures executing multiple functional specifications, each 
20 specification having the form of a task graph. An embodiment of the present invention 
comprises a method for generating computer system level architectures that are capable 
of executing multiple functional specifications within a set of predetermined physical 
resources and that are subject to a set of design constraints. The method for generating 
the computer system level architectures includes forming an initial master task graph 
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from the multiple specifications. The master task graph has at least one hierarchical task 
that has pointers to a pliirality of sub-task graphs. The master task graph also includes 
one or more AND or XOR attributes. The initial master task graph is processed to 
provide a selected number of final master task graphs. Each of the final master task 
5 graphs include a list of the AND task graphs. A family of computer system level 
architectures are then generated for each of the final master task graphs. Each of the 
generated computer system level architectures are capable of executing all the AND task 
graphs that are included in the final master task graph. The family of computer system 
level architectures are then explored to determine their usefiilness in executing the 

10 multiple specifications. 

Another embodiment of the present invention may be an article of manufacture 
that is capable of executing multiple functional specifications, given a set of resources, 
and subject to a set of design constraints. The article of manufacture may include a 
computer readable medium, such as a magnetic tape, a floppy disk, magnetic disk, read- 

15 only memory, digital disk memories, or other computer readable mediums. The article of 
manufacture also would include a plurality of instructions, such as a computer program, 
microprocessor/microcontroller instructions, etc. stored, at least in part, on the computer 
readable medium wherein the plurality of instructions are configured to cause a processor 
to form a master task graph fi-om the multiple specifications. The master task graph 

20 includes at least one hierarchical task having pointers to a plurality of sub-task graphs, 
and one or more AND or XOR attributes. The plurality of instructions further cause the 
processor to process the intial master task graph to provide a selected number of final 
master task graphs. Each of the final master task graphs may comprise a list of AND task 
graphs. The plurality of instructions also cause the processor to generate a family of 
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architectures for each of the final master task graphs. Each of the generated architectures 
are capable of executing every AND task graph included in the list for the given final 
master task graph. The plurality of instructions further cause the processor to explore 
each of the generated architectures for their actual abilities for executing the multiple 
5 specifications. 

More specifically, the issue of modeUng multiple specifications is first 
considered for the case where a single architecture must satisfy multiple task graph 
executions. An example of a single architecture that must satisfy multiple task graph 
executions is an architecture that is required to execute both GSM and CDMA protocols. 

10 In accordance with embodiments of the invention, it is desired to have an architecture 
synthesis engine that generates architectures which can execute two specifications, each 
specification will have a task graph such as TGI and TG2. The user then creates a new 
task graph such as MGT, with one task in it. This task is a hierarchical task with pointers 
to TGI and TG2 as its sub-task graphs, and has an attribute, referred to as an AND 

15 attribute, that tells the architecture synthesis engine to treat the sub-task graphs as 

multiple specifications. The AND attribute must be set by the user. Thus, MGT is sent 
to the engine for exploration of architectures and is referred to as a master task graph. 

In a second case, where there are multiple architecture choices for implementing a 
task in a task graph. The, second case, multiple specification modeling is similar to the 

20 AND case, referred to above. For illustration it is assumed that there are two options, 01 
and 02 for task T. Accordingly, T is a hierarchical task with Ol and 02 as its sub-task 
graphs. T has an attribute, referred to as an XOR attribute, that tells the architecture 
synthesis engine to treat the sub-task graphs as a multiple choice, that is, the final 
selected architecture need only support the execution of one of the sub-task graphs. For 
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XOR sub-task graphs, the generated architectures must support the execution of at least 
one sub-task graph with or without supporting the executions of any other sub-task 
graphs at that level of hierarchy. Note that each XOR sub-task graph produces a separate 
master task graph. Each of the master task graphs can be explored individually and only 
5 the results need to be ranked collectively. After the master task graphs have been 
produced, they are each appUed to an architecture synthesis engine. The architecture 
synthesis engine may be a processor implementing an algorithm to generate and explore 
respectively corresponding architecture families. 

One embodiment of the invention is a method for exploring alternative 

10 architectures for executing multiple specifications. The method includes forming an 
initial master task graph from the multiple specifications. The initial master task graph 
includes at least one hierarchical task having pointers to a pluraUty of sub-task graphs, 
and one or more attributes that may be AND attributes, XOR attributes or both. The 
method further includes processing the initial master task graph to provide a selected 

1 5 number of final master task graphs, each of the final master task graphs comprising a list 
of AND task graphs, and generating a family of architectures for each of the final master 
tasks graphs, each of the architectures generated for a given final master task graph being 
capable of executing every AND task graph included in the list corresponding thereto. 
Each of the generated architectures is explored for use in executing the multiple 

20 specifications. 

In another embodiment, each of the architectures generated for a given final 
master task graph is placed into a pool and a particular architecture is retained in the pool 
only if the particular architecture can execute each AND task graph of the given final 
master task graph in accordance with a prespecified time schedule. Preferably, respective 
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architectures retained in the pool are evaluated by means of a fitness function and also by 
applying a convergence criteria to the architectures. Additional new architectures may be 
generated for the pool by applying a genetic algorithm to one or more previously 
generated architectures. 

5 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 is a block diagram illustrating modeling of multiple specifications for an 
embodiment of the invention. 
10 FIG. 2 is a schematic diagram showing a master task graph and its associated sub- 

task graphs for processing in accordance with an embodiment the invention. 

FIG. 3 is a flow chart showing an algorithm useful in exploring architectures for 
an embodiment of the invention. 

FIG. 4 is a schematic diagram illustrating genetic mutation for an embodiment of 
15 the invention. 

FIG. 5 is a schematic diagram illustrating genetic crossover for an embodiment of 
the invention. 

DETAILED DESCRIPTION OF THE EXEMPLARY EMBODIMENTS OF THE 
20 INVENTION 

hi accordance with the invention, a number of computer architectures are 

generated and explored. Each architecture must be capable of executing multiple 

specifications or task graphs. As used herein, as well as used generally in the art, a task 

25 graph is a functional specification that the resulting architecture must support. A task 

graph is characterized by a name, identifications and lists of tasks, parts and connections. 

Note that a computer architecture comprises various hardware and software components 

that are more easily selected by utilizing an embodiment of the present invention. 
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The user commences the architecture exploration process by creating a set of task 
graphs. Each task graph describes the functional behavior of one or more parts of the 
system or systems to be designed. Task graphs are described in a hierarchical fashion, 
where sub-task graphs may be related to each other by AND or XOR relationships. 
5 A hierarchical task having an AND attribute is hereinafter referred to as an AND 
hierarchical task, and its associated sub-task graphs are referred to as AND sub-task 
graphs. In hke manner, a hierarchical task having an XOR attribute is hereinafter 
referred to as an XOR hierarchical task, and its associated sub-task graphs are referred to 
as XOR sub-task graphs. 
10 After the set of task graphs are created, the task graphs are collectively combined 

into a user specification, or initial master task graph. The initial master task graph is 
passed to a preprocessor that resolves the hierarchy in the task graphs. For each XOR 
case, a final master task graph is created, as described in further detail in connection with 
FIGS. 1 and 2. 

15 Referring to FIGS. 1 and 2 together, there is shown a process block 10 depicting 

a process step wherein a user operating a processor combines multiple specifications 
together to form an initial master task graph, such as master task graph TGI shown in 
block 12 of FIG. 2. Block 12 fiirther shows an exemplary master task graph TGI 
including interconnected tasks T1-T5, wherein task T2 is an AND hierarchical task with 

20 pointers to sub-task graphs SG3 and SG5. In a like manner, task T4 of master task graph 
TGI is an AND hierarchical task with pointers to exemplary sub-task graphs SG2 and 
SG4. AND sub-task graphs SG3, SG5, SG2, and SG4 are depicted in FIG. 2 as blocks 
14-20, respectively. 
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It is to be understood that since master task graph TGI contains a number of 
AND hierarchical tasks, TGI can alternatively be represented as a set of task graphs that 
incorporate different combinations of the AND sub-task graphs. Thus, by selectively 
processing the initial master task graph TGI, as shown by process block 22 of FIG. 1, 
5 TGI can be resolved or flattened into first sets of AND task graphs TG1_1, TG1_2, 
TG1_3 and TG1_4, as follows: 

TG1_1 = TGI + SG3 + SG2 
TG1_2 = TGI + SG3 + SG4 
TG1_3 = TGI + SG5 + SG2 
10 TG1_4 = TGI + SG5 + SG4 

It will be appreciated that respective first task graph sets TG1_1-TG1_4 comprise 
different combinations of the AND sub-task graphs, together with the remaining tasks of 
initial master task graph TGI . It is appreciated further that exemplary architectures 
generated in accordance with an embodiment of the invention must be able to support 
1 5 execution of all AND task graphs in each first set. 

Referring further to FIG. 2, there is shown task T2 of AND sub-task graph SG3 
comprising an XOR hierarchical task with pointers to exemplary sub-task graphs SG6 
and SG7, depicted in blocks 24 and 26, respectively, of FIG. 2. Accordingly, the first 
task graph sets TG1_1 through TG1_4 can be further resolved into second sets of AND 
20 task graphs TG1_1_1, TG1_1_2, TG1_2_1, TG1_2_2, TG1_3_1, and TG1_4_1, as 
follows: 

TG1_1_1 - TGI + SG3 + SG6 + SG2 
TG1_1_2 = TGI + SG3 + SG7 + SG2 
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TGI. 


_2_ 


_1 = TGI + SG3 + SG6 + SG4 


TGI. 


_2_ 


2 = TGI + SG3 + SG7 + SG4 


TGI. 


_3_ 


_1 = TGI + SG5 + SG2 


TGI. 


_4_ 


.1 = TGI + SG5 + SG4 



5 As stated above, an architecture resulting from the exemplary architecture 

exploration and generation method and system must be able to execute all of the tasks 
associated with one sub-task graph of an XOR hierarchical task. However, the resulting 
architecture does not need to be able to execute tasks of other XOR sub-task graphs, 
since such tasks will not be needed if the particular architecture and its associated sub- 

10 task graph is ultimately chosen for use. Accordingly, respective AND task graphs of the 
second sets shown above must be regrouped or reordered into Usts, on the basis of their 
respective corresponding XOR sub-task graphs. The process of reordering AND task 
graphs indicated by process block 28 of FIG. 1, provides two final master task graphs 
MGTl and MGT2, each comprising a list of AND task graphs. More particularly, MGTl 

15 comprises a list of all tasks associated with XOR sub-task graph SG6, and MGT2 
comprises a list of all tasks associated with XOR sub-task graph SG7. Thus, 
MGTl = TG1_1_1, TG1_2_1, TG1_3_1, TG1_4_1, and 
MGT2 = TG1_1_2, TG1_2_2, TG1_3_1, TG1_4_1. 
Referring further to FIG. 1, there is shown process block 30 indicating that 

20 respective task graphs of a final master task graph are applied or directed to an 

architecture synthesis engine, in order to generate and explore a corresponding family of 
partitioning architectures. An exemplary architecture synthesis engine in accordance 
with the present invention utilizes a microprocessor based device and related data storage 
apparatus. The combination of the microprocessor and storage apparatus allows an 
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exemplary embodiment of the present invention to be stored in the storage apparatus such 
that the microprocessor is directed by the architecture synthesis engine. The architecture 
synthesis engine may comprise a processor utilizing software programmed to synthesize 
alternative architectures for executing a specification or set of tasks. 
5 After a family of architectures has been produced for a particular final master task 

graph, the family serves as an initial pool of system architectures. The suitability of 
these architectures, for executing respective task graphs of the particular final master task 
graph, is explored in a number of steps or stages, in accordance with an architecture 
exploration algorithm. A flow chart for such algorithm is depicted in FIG. 3. FIG. 3 

1 0 shows architecture synthesis and exploration with respect to final master task graph 
MGTl, but is equally applicable to MGT2. 

The architectiire synthesis engine executes such process as follows: (1) read 
component library, design specifications and the master task graphs MGTl and MGT2; 
(2) initiaUze the engine with parameters that dictate exploratory behavior and scope; and 

15 (3) for each master task graph generating a family of architectures, as described 
hereinafter in further detail. 

Referring to FIG. 3, there is shown an architecture synthesis or partitioning 
engine 32 employed to provide an initial pool of architectures corresponding to master 
task graph MGTl. Connectivity is completed for each architecture in the pool. As 

20 indicated by process block 34, each architecture is then tested or evaluated to determine 
whether each architecture can comply with timing requirements. More specifically, 
a usable architecture must be able to execute the tasks of each listed AND task graph of 
MGTl, in accordance with time schedules respectively corresponding thereto. 
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If an architecture successfully complies with the schedules of all the AND task 
graphs, the architecture is moved through process block 36 to a process block 38, 
described hereinafter. Otherwise, a methodology is utihzed at process block 36 to fix or 
adjust the architecture to comply with the schedule. The adjusted architecture is then 
5 routed back to process block 34 and reevaluated. This evaluation and adjustment cycle 
may be repeated a fixed number of times (N) for each architecture that did not initially 
comply with the schedules of the AND task graphs. After the Nth cycle, if the 
architecture is still unable to meet all time schedules for respective master task graphs, it 
is discarded and moved to a reject pool. 

1 0 Referring fiirther to FIG. 3, it is seen that architectures routed to process block 3 8 

are respectively evaluated in accordance with a fitness fimction. The fitness fiinction 
enables architectures to be given numerical values, to be compared with one another, and 
to be ranked. The fitness function may be designed to favor, for example, architectures 
that have lower power consumption requirements or fewer discrete components. The 

1 5 fitness histories of respective architectures are also recorded for use in ranking. 

After the architectures have been evaluated and ranked in block 38, a specified 
number of the higher ranked architectures are selected for placement in a current 
architecture pool, as indicated by process block 40 of FIG. 3 . The remaining unselected 
architectures may be moved to a reserve pool. For example, if one hundred architectures 

20 are evaluated at process block 38, the twenty most highly valued may be selected for 
placement in the current architecture pool. 

At process block 42 the current architecture pool is checked for population 
fitness, convergence and other numerical termination criteria. If the termination criteria 
is met, the current pool architectures exit the main loop and may be displayed. If the 
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termination criteria is not met, a genetic algorithm is employed to create new 
architectures for the current master task graph. 

FIG. 4 illustrates a type of genetic manipulation known as architecture mutation. 
In architecture mutations, an empty "temp" pool is initially created. One of the task 
5 graphs of MGTl is randomly selected, and a task is randomly chosen therefrom. An 
architecture is randomly selected from the current pool, and an implementation of the 
chosen task is randomly selected to be a component in the selected architecture. If the 
selected component is different from the components previously used in the task 
implementation, the selected architecture is copied and changed to the new task- 

10 implementation mapping. The architecture so formed is added to the "temp" pool as 
a temp architecture. Thus, FIG. 4 shows a selected architecture 44 mutated to a temp 
architecture 46 by replacing the component Bl with the component B6. At the end of the 
mutation process all temp architectures in the newly created "temp" pool are added to the 
ones existing in the ciurent pool. 

1 5 FIG. 5 illustrates a second type of genetic architecture manipulation known as 

crossover. In accordance therewith, a task graph of MGTl is randomly chosen. 
A random number of architectures that are less than or equal to half the number of 
architectures in the current pool is selected. Each selected architecture is then randomly 
paired with another selected architecture to form a parent pair, each member of the pair 

20 referred to as a parent. For each parent pair, a task position in the list of tasks of the 
chosen task graph is randomly selected, and the implementation of the task is switched 
between the parents of the pair. This gives rise to two new children architectures that are 
added to the "temp" pool. Thus, FIG. 5 shows a component 48 of a parent 50 joined with 
a component 52 of another parent (not shown) to produce a child 54. At the end of the 
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crossover process all new architectures in the newly created "temp" pool are added to the 
ones existing in the current pool. 

In a third type of genetic architecture manipulation, an architecture is randomly 
selected from the current pool, and a functional block is randomly selected from the 
5 selected architecture. All functional blocks in the selected architecture that are duplicates 
of the randomly selected functional block are randomly merged. Associated task 
mappings are merged to create a new architecture, which is added to the current pool. 

Obviously, other modifications and variations of the present invention are 
possible in light of the above teachings. It is therefore to be understood that within the 
10 scope of the disclosed concept , the invention may be practiced otherwise than as 
specifically described. 
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